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FIG. 7 



1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

'2 01 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 

251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

3 01 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

3 51 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

4 01 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

5 01 GATCTCGATA CATCAGACTT AC C AAAGAC A AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

7 01 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

8 01 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 

9 01 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

10 01 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

12 01 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1251 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

13 51 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

14 01 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATT AC C A ATCTTGATTG 
14 51 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
17 01 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

"151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

2 01 TCCAGAAAGT TATTAGATGA AGAAAT C AAT CTCAAAGTCA TTATAAAGAT 

2 51 T T AG AAG AT A TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
351 CAACTGGACT AATGAT CGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

4 01 T CAT C AAG CG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 

6 01 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 

7 01 GCAGACATGA ATGAACCTCG ATGTTG AT CA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 

8 01 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
8 51 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAG CTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGT CGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAAC CGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

13 01 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGAT CAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 

14 51 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 

15 01 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGT CAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 

16 01 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
16 51 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
18 01 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
18 51 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2 001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTAT CGA TTCGAAAAAC 
2 051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
22 01 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 

22 51 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

23 01 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AAT CAATGAA 
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24 01 CCCTGAAAAT CCAAATGTTA 

24 51 TTTTGATTAC AACTCAAGAG 

25 01 ATAAACGTAA GTCGTGAAAA 

25 51 GACTAGTTTT CCATTAAGTC 

26 01 GTGT TGAGGA GCAAAATATT 

26 51 TCAATTGATA TGAAAACTCT 

27 01 AGAAGGAATT CTGTGTACAC 

27 51 CAATGTGGCT CAAGAAGAAA 

28 01 CATTATTTTA GAAAGACGAT 
2851 CAACAAGTTA TTT AT AT CAG 
2901 AATACAAGGA CCACTTTAAG 
2951 TTAGAGGTAT CTAAAATTAT 
3 0 01 CCTTGTGTGC AATATTAAGG 
3 051 ACTTTTTCCT TAGCACACTG 
3101 AAGTACATTT TCAACAGAGT 
3151 GCTAAAAAGT GACCAATGTC 
32 01 CTATTCTAAC TTATTTTGGA 
3 251 TGGGGTTTTG GGGTTTTGGG 



ATCTTCTAAT GAGACTTACA GATGACTATC 
AATAATG C AG TATTGTT TAT TGAGAAACTT 
TGGATTTAAA TTCAATATGA AGAAACTACA 
CAAGCAAATT TG CAAAAT AC GGAATGGATA 
GTTCAAGATT ACTGCGATTG GATTGGCATC 
TGCTTTAATG C CAAATATT A A CT TGAGAAT 
TCAATCTAAA CATGCAAACA AAG AAAG CAT 
CTAAAGTCGT TTTTAATGAA TAACATTACC 
TACAACCGAA GACTTTGCGA ATAAAACTCT 
GCGGTTACAA ATACATGCAA TGAGC CAAAG 
AAGAACTTAG CTATGAGCAG TATGATCGAC 
ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
ATACAATTTT TGGAGAGGAG CATTATCCAG 
AAGCACTTTA TTGAAATATT CAGCACAAAA 
TTGCATGATC CTCAAGGCAA AAG AAG C AAA 
AATCTCTAAT TCAATATGAT GCATAGTCGA 
AAGTTAATTT TCAATTTTTG T CTTATAT AC 
GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEE I KEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 

251 NQFFKKHEFV SNKNN I S AMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENL INKTREE KSKYYEELFS 

351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 

4 01 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 

451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 

501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 

551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 

6 01 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 

651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 

701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 

751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 

801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 

851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 

901 HYFRKTITTE DFANKTLNKL F I SGG YKYMQ CAKEYKDHFK KNLAMSSMID 

951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

10 01 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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C C C C AAAAC C C C AAAAC CC C AAAAC CCCT ATAAAAAAAGAAAAAATTG AGGT AGTTT AGA 

1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

PQNPKTPKPL*KKKKLR*FR 
PKTPKPQNPYKKRKN*GSLE - 
PKPQNPKTPIKKEKIEVV* K - 

AATAAAATATTATTC C CGC AC AAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

NKILFPHKWRWILIWMI*KI 
I KYYSRTNGDGY* FG*YRKF- 
* N I IPAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

YFLIHSTS IAALVVTRKDAK 
TS*YIQQV*QLL* *QERMQN- 
LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

1 81 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

H*NLARNRLH*LFQSCKNN* 
IEIWLEIAFIDYSKVAKTIR- 
LKSGS KS PSLT I PKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

SSTSRMQIFITILS *EN*F* 
VLLLGCKSL*RFFLEKISFK- 
FY FSDANLYNDS FLRKLVL K- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE* KLKHY* CLNKIR* 

KRRAKSRN*NITNV* IKSGN- 
SGEQRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

36 1 + + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDHFLRS IMEKIT* 

EDYSIF* ITS*GALWRKLLN- 
RI ILFFRSLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSLDYFPSQQ**VY*I 

b TKR*TVWI I SLANNDEYIKF- 

c LKGKQFGLFP* PTMMS ILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRI S IHQTYQRQTRY 

b I*E*VKGSRYIRLTKDKLAI 

C YENESKDLDTSDL PKTN SL * - 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + + + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKV* *SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR 

C NARKSLI IEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 + + + + + + 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRR*TPES*DN 

b MGFITIVLGIDGELPSLETI 

c WVLLQLF *VS. TVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a *KSCLQLKESQF*KF*CVCH 

b EKAVYN*RNRSSESSDVYAI 

C KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN* SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

C FCELISNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT 

C K*TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGA7UVTCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD* FS 

b EFILDS* SIDTQNALETDLA- 

C NLYWILKA* IHRML* RLI * L- 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a L Q Q I TCFDYSCSSLI SLKEA 

b YNRLPVLITLAHLLYL* KKQ 

c TTDYLF* LLLLI SYIFKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 + + + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEIS KFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

C RNE KKTKERDFKI C * FFCNR- 

GGAATTAAC AACAAGAATATTAG CAACGAAAAAGAAGAAGAGCTATC ACAATC C TGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS* F 

b ELTTRILATKKKKSYHNPDS- 

c N * Q Q E Y * QRKRRRAITI L I L - 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + + + H40 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKI PGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS 

c KDFKNSR*ERYIH*NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

c FHFTAVI FFYLNNI F * LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSI K*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI 

c KKYQIREALD*GNLAYSHS*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

C IDLHI SNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a *CYED* IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAEILIKK- 

c VL* GLNF* SQEMEPKS * SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

13 81 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL*IFR* *VLP 

b NCVDIAKESNSKSFVNKYYQ- 

c IASILQKNRTLNLSLISITN- 

ATC TTGATTGATTGAAGAGATTGACGAGGCAACTGCAC AGAAGAT CATTAAAGAAATAAA 

1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCR D* RGNCTEDH*RNK 

b S*LIE EIDEATAQKI IKEIK- 

C LD* LKRLTRQLHRRSLKK* S - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN* ITNIEISDLQ 

b *LLLIRE*TKLLI*RSAIFN- 

c NFY * LENKLNY* YRDQRSS I - 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK*KLN*S*TIKNTNLGQN 

b *RNKS*TKVRQ*KIQTLVKI 

c DEIKAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIRQ* IK* 

b LRKEKKTS*QKKK*GNK*NE 

c * GR KRR PVSKRKNKAI NKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*RNKRFIFFNNLLKRGV 

b YRSEE I KDLFFS I I Y * KEGF 

C TEVKK* KI YFFQ* FI EKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
c GVLGFW 
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2 EVDVQNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
j::: | :.::|: :| |.::| ::. . | |..|.|.| 

19 ELELEMQENQNDIQVRVK. . . . IDDPKQY . . LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF . STGLMIELIDKCLVELL 100 

: .|.| ..|.. .|: ..:| | ...::|.: 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VYIRNELYIRTTTNYIVAFCWHKNTQPFIEKYFNKAVLLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKI LGKKYPKTEEE YKAAFGDS ASAPFNPELAGKRMKI E I SKTW 
ENELS AKGNTAEVWDNL I S SNQL P YMAMLRNLSN I LKAGVSDTT 
HS I VINKI CE PKAVENSKMF PLQFFS AI EAVNEAVTKGFKAKKR 
ENMNLKGQ I EAVKE WEKTDEEKKDMELEQTEEGE FVKVNEG I G 
KQYINS I ELAI KI AVNKNLDE I KGHTAI FSDVSGSMSTSMSGGA 
KKYGS VRTCLECALVLGLMVKQRCEKS SFYIFSS PS SQCNKC YL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDE FNENNY IKIFGMSDSILKFI S AKQGGANM 
VEVI KNFALQKIGQK 

FIG. 20 



MSRRNQKKPQAP IGNETNLDFVLQNLEVYKSQI EH YKTQQQQ I K 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVS I YANQKTS YCWWLKDYFNKNNYDHLNVS INRLETEAEFY 
AFDDFSQT I KLTNNS YQTVNIDVNFDNNLC I LALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQYLVNS S SQ I S VKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKI ESSSLESLEDIDSLCKSIASCKNLQNVNI IASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVS INC ILDQHILNS ISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 



FIG. 22 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
N YT V I Q FNGQ F FTQ I VGNR CNE PHL P P KWVQRS S S S S AT AAQ I K 
QLTE PVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
I F PTNLVKI PQRLKVR I NLTLQ KLLKRH KRLNYVS ILNSICPPL 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTIVYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRI 
IPKKSNNEFRI IAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSCYDS I PRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNWNASRVPKPYELYIDNVRTVHLSNQDVINWEMEIFKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLIISTDQQQVINIKKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
S KG I F R S L I AL FNTR I S YKT I DTNLN S TNT VL MQ I DHWKN I S E 
CYKSAFKDLSINVTQNMQFHSFLQRI IEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNI ILLRKEIQHLQAYIYI YI 
HIVN 

FIG. 23 
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1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 

61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 

121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 

181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 

241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 

3 01 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 

3 61 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 

421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 

481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 

541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 

601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 

661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 

721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 

781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 

841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 

901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 

961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 

1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 

1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 

1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 

1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 

1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 

1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 

1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 

1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 

1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 

1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 

1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 

1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 

1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 

1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 

1861 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 

1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 

1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 

2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 

2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 

2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 

2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 

2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 

2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 

2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 

2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 

2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 

2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 

2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 

2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 

2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 

2821 aaaaaatcg 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 

FIG. 24 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAG AAGAATT C AC AATTTAT AAGGAGAATC ACAAAAATG CTAT C CAG C C 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGAC CAACAG CAAGTGAT CAATAT CAAAAAG CTTG C CATGGG CGG ATTT CAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAT^AGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAT^A 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYWGARTFRREK 
RAERLTSRVKALFSVLNYERA 

FIG. 27 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 



FIG. 28 



MTEHHTPKSRI LRFLENQYVYLCTLNDYVQLVLRGS PASS YSNICERLRSDVQTSFS I FLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLIS I LESKNWQLLLE 1 1 GSDAMHYLLSKGS I FEALPNDNYLQI SGI PLFKNNVFEETVSKKRKR 
T I ETS I TQNKS ARKEVSWNS I S I SRFS I F YRS S YKKFKQDLYFNLHS I CDRNTVHMWLQWI FPRQ FG 
LINAFQVKQLHKVIPLVSQSTWPK^LLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVI^ 

QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
I YRLGYSMCMRAQAYLKRMKD I FI PQRMFITDLLNVIGRKI WKKLAE ILGYTSRRFLS SAE VKWLFC 
LGMRDGLKPS FKYH PCFEQL I YQFQSLTDL I KPLRPVLRQVLFLHRR IAD 



FIG. 29 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RQH LKR VQL RDV SEA EVR QHR EA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 



A- 1 : AKF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 

FIG. 32 



Poly 4 

t t c 

ta a g c c teg 
5»- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 

4(B') 



5(C) 

D D Y L L I T 
3 1 - ctg ctg atg gag gag tag tgg -5 1 
a a aaaaa a a 
t t t t 
c c 
Poly 1 
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B2- 



Vector 



A5 



5825 bp Sequenced 
2 kb Hind III Fragment 



Genomic DNA Insert 



Vector 



1 kb 



FIG. 33A 



tezl + 



Introns 1 2 3 



RT Motifs 12 3(A) 4(B') 5(CX6(C) 



4-1 cDNA 
2-3 & 5-20 cDNA 



56 7 8 9 10 11 12 131415 



Hind III Xca I Xca I 



Original PCR 
3' RT-PCR 



5 1 RT-PCR W/M2-B1 4 
5" RT-PCR W/M2-B15 Band A 
5" RT-PCR W/M2-B15 Band B 
5" RT-PCR W/M2-B16 Band C 



Hind III 




500 bp 



FIG. 33B 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene:: 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 

3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 43 
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heterozygote diploid 
wt diploid 
wt haploid 



Tetrad 




600 bp 
500 bp 
400 bp 

300 bp 



Telomeric 
Apa I 
fragment 



FIG. 45 
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i 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 



gin gly ile pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT CGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 

390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 



220 



230 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

2 01 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

2 51 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

3 01 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

3 51 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

4 01 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 

4 51 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 

5 01 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 

5 51 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 

6 01 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 

6 51 CAACGGGCCT GGAAC CATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 

7 01 CCAGCCCCGG GTG CGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAG CGGACG C 

8 01 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 

8 51 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAG C C AC 

9 01 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

10 01 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

1051 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

12 01 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 

12 51 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

13 01 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

13 51 TGTGGCGGCC C C CGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

14 01 CTGCTCCGCC AG C AC AG C AG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

14 51 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

15 01 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAG CAT 

15 51 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

16 01 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

16 51 ACCGTCTGCG TGAGGAG AT C CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

17 01 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TT TTATGTC A CGGAGACCAC 
1751 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 

18 01 TG C AAAG CAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1851 CTGTCGGAAG CAGAGGT C AG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 

19 01 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1951 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2 0 01 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
2 051 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
2 2 01 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AG CAT CAT C A 
2 2 51 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGC CGTGGT CCAGAAGGCC 
2 3 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
2 3 51 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
24 01 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC C CTGAATGAG 
24 51 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
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2 5 01 CGTGCGCATC AGGGG CAAGT 

2 551 GCTCCATCCT CTCCACGCTG 

26 01 AACAAGCTGT TTGCGGGGAT 

26 51 GGATGATTTC TTGTTGGTGA 

2 7 01 TCAGGACCCT GGTCCGAGGT 

2 751 CGGAAGACAG TGGTGAACTT 

2 8 01 GGCTTTTGTT CAGATGCCGG 

2 851 TGCTGGATAC CCGGACCCTG 

2 901 CGGACCTCCA T C AG AG C C AG 

2 951 GAGGAACATG CGTCGCAAAC 

3 001 GCCTGTTTCT GGATTTGCAG 
3 0 51 ATCTACAAGA TCCTCCTGCT 
3101 GCAGCTCCCA TTTCATCAGC 
3151 GCGTCATCTC TGACACGGCC 
32 01 AACGCAGGGA TGT CGCTGGG 

32 51 CGAGGC CGTG CAGTGGCTGT 

33 01 GACACCGTGT CACCTACGTG 
3351 ACGCAGCTGA GTCGGAAGCT 

34 01 CGCAGCCAAC CCGGCACTGC 
34 51 GGCCACCCGC CCACAGCCAG 
3 5 01 GCCGGGCTCT ACGTCCCAGG 
3 551 ACCGCTGGGA GTCTGAGGCC 
3 6 01 CCGGCTGAAG GCTGAGTGTC 
3 6 51 GGGCTGAGTG TCCAGCACAC 
3 7 01 CTCGGCTCCA CCCCAGGGCC 
3 751 CTCCCCACAT AGGAATAGTC 
3 801 GCCCTGCCCT CCTTTGCCTT 
3 851 GAAGGACCCT GGGAGCTCTG 
3 901 TACACAGGCG AGGACCCTGC 
3 951 GGGGGGAGGT GCTGTGGGAG 
4001 TTGAAAAAAA AAAAAAAAAA 



CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
CTCTGCAGCC TGTGCTACGG CGACATGGAG 
TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
CACCTCACCT CACCCACGCG AAAACCTTCC 
GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
GAGGTG CAGA GCGACTACTC CAGCTATGCC 
TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
GTGAACAGCC TCCAGACGGT GTGCACCAAC 
GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
CCACTCCTGG GGTCACTCAG GACAGCCCAG 
CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
CCTCAGACTT CAAGACCATC CTGGACTGAT 
GCCGAGAG C A GACACCAGCA GCCCTGTCAC 
GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
CATCCCCAGA TTCGCCATTG TTCACCCCTC 
CCACCCCCAC CATCCAGGTG GAGACCCTGA 
GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
TAAAATACTG AATATATGAG TTTTTCAGTT 
AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 



! + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGS PGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

c SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

6 1 + + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRALPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL- 

c ALPAAEPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

a AAGHVRAAPGAPGLAAGAAR 

b PLATFVRRLGPQGWRLVQRG- 

C RWPRS CGAWGPRAGGWCSAG- 

GGACCCGGCGGCTTTCCGCGCG^^^GGTGGCCCANTGCNTGGTGTGCGTGCCCTGGGANGN 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

a GPGGFPR?GGP? 7GVRALG? 

b DPAAFRA?VA?C?VCVPW??- 

c TRRLSARWWP7AWCACPG? ? - 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCAGAGGACGGACTTNCTO 

a 7AAPRRPLLPPGVLPE? ?GG 

b ??PPAAPSFRQVSCL??LVA- 

c G?PPPPPPSARCPA*??WWP- 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 

301 + + + + + + 360 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTIT^TGCACGACCGGAAGCCGAAGCG 

a PSAA?AVR?RRE 7RAGLRLR 

b RVL??LC?RGA?NVLAFGFA- 

C ECC??CA?AAR? TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

361 + + + + + + 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

a AAGRGPRGPPRGLHHQRAQL 

b LLDGARGGPPEAFTTSVRSY- 

C CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

a PAQHGDRRTAGERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR- 

C CPTR* PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR? ? C A G 

b RVGDDVLVHLLARCA? F V L V - 

C AWATTCWFTCWHAAR? LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

54! + + + + + + 600 

CCNAGGGTCGACGCGGATGGTOCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

a GSQLRLP?VRAAAVPARRC? 

b ?PSCAY?VCGPPLYQLGAAT- 

c ? PAAPT ? CAGRRCTSSAL ? L - 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

601 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

a SGPAPATR? WTR?RLGSNGP 

b QARPPPHA?GPE?VWDPTGL 

C RPGPRHTL?DP?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

661 + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

a GTIASGRPGS PWAAS PGCEE 

b E p* RQGGRGPPGLPAPGARR 

C NHSVREAGVPLGCQ PRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

721 + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

a ARGQCQ PKSAVAQEAQAWRC 

b RGGSASRSLPLPKRPRRGAA- 

c AGAVPAEVCRC PRG PGVAL P- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

a P *AGADARWAGVLGP PGQDA 

b PEPERTPVGQGSWAHPGRTP- 

C LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

a WTE* PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCALWHAPLPPI RGPPA 

b SLEGALSGTRHSHPSVGRQH- 

C LWRVRSLARATPTH PWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

96 1 + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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a PRGPPIHIAATTSWDTPCPP 

b HAGPPSTSRPPRPGTRLVPR- 

C T R A P ,H PHRGHHVLG HALS P G - 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNA^ 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTC^ 

a VYAETKHFLYSSGDK? TA?L 

b CTPRPSTSSTPQAT?TLRPS- 

C VRRDQALPLLLRRQ ?HC? P P - 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

a LPTQYI *GPA*LAFGRFVET 

b FLLNISEAQPDWRSGGSWR? 

c SYS IYLRPSLTGVREVRGD?- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

a ? FLVPGLGCQDSPQVAPPAP 

b SFWFQALDARIPRRLPRLPQ- 

c LSGSRPWMPGFPAGCPACPS- 

GCGNTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

1201 + + + + + + 1260 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACGGG 

a A? LANAAPVSGAAWE PRAVP 

b RYWQMRPLFLELLGNHAQCP- 

c ? TGKCGPCFWSCLGTTRSAP- 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCOT 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

a LRGVPQDALPAASCGHPSSR 
b YGVFLKTHCPLRAAVTPAAG 
c TGCS SRRTARCELRS PQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

a CLCPGEAPGLCGGPRGGGTQ 

b VCAREKPQGSVAAP EEEEHR 

C SVPGRS PRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 

1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

a TPVAWCSCSASTAAPGRCTA 
,b PPSPGAAAPPAQQPLAGVRL- 
C PRRLVQLLRQHS S PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCAC^CG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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a S CGPACAGWCPQASGAPGTT 

b RAGLPAPAGAPRPLGLQAQR- 

c VRACLRRLVPPGLWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCT 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 

a NAASSGTPRSSSPWGSMPSS 

b TPLPQEHQEVHLPGEACQAL 

C RRFLRNTKKF I SLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGA 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

a RCRS * RGR*ACGTALGCAGA 

b AAGADVEDERAGLRLAAQEP- 

c LQEL TWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 

1621 + + + + + + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

a QGLAVFRPQSTVCVRRSWPS 

b RGWLCSGRRAPSA*GDPGQV 

C GVGCVPAAEHRLREE ILAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC'l-l-l-l'ATGTCA 

1681 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

a SCTG**VCTSSSCSGLSFMS 

b PALADECVRRRAAQVFLLCH 

C LHWLMSVYVVELLRSFFYVT- 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTC 

1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTCGTTCA 

a RRPRFKRTGSFSTGRVSGAS 

b GDHVS KEQAL FL PE E^C L EQV 

C ETTFQKNRLF FYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGC^ 

1801 + + + + + + 1860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTT^ 

a CKALESDST*RGCSCGSCRK 

b AKHWNQTALEEGAAAGAVGS 

C QSIGIRQHLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

a QRSGSIGKPGPPC*RPDSAS 

b RGQAASGSQARPADVQTPLH- 

C EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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SPSLTGCGRL*TWTTSWEPE 
P Q A * RAAADCEHGLRRGSQN 
PKPDGLRP I VNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCT 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACTTCCGTGACAAGTCGC 

RSAEKRGPSVSPRG* RHCSA 
VPQRKEGRASHLEGEGTVQR 
FRREKRAERLTSRVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

CSTTSGRGAPASWAPLCWAW 
AQLRAGAAPRPPGRLCAGPG 
LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

TISTGPGAPSCCVCGPRTRR 
RYPQGLAHLRAACAGPGPAA 
D IHRAWRTFVLRVRAQDPP P - 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

2161 + + + + + + 2220 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

LSCTLSRWM*RARTTPSPRT 
*AVLCQGGCDGRVRHHPPGQ - 
ELYFVKVDVTGAYDTI PQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGG 

GSRRSS PASSNPRTRTACVG 
AHGGHRQHHQTPEHVLRASV 
LTEVIASI IKPQNTYCVRRY- 

ATGCCGTGGTCGAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

MPWSRRPPMGTSARPSRATS 
CRGPEGRPWARPQGLQEPRL 
AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAG^ 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTC 

LP*QTSSRTCDSSWLTCR?T 
YLDRPPAVHATVRGSPAG?Q- 
TLTDLQ PYMRQFVAHLQ ?NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + 4- + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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AR*GMPSSSSRAPP*MRPAV 
PAEGCRRHRAELLPE* GQQW- 
PLRDAVVI EQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

ASSTSSYASCATTPCASGAS 
PLRRL PTLHVPPRRAHQGQV- 
LFDVFLRFMCHHAVR I RGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

PTSSARGSRRAPSSPRCSAA 
LRPVPGDPAGLHPLHAALQP 
YVQCQGI PQGS I LSTLLCS L - 

TGTGCTACGGCGACATGGAGAACT^GCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

CATATWRTSCLRGFGGTGCS 
VLRRHGEQAVCGDSAGRAAP - 
CYGDMENKLFAG I RRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTT^ 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGA^ 

CVWWMISCW*HLTS PTRKPS 
AFGG* FLVG DTS PH PRENL P 
RLVDDFLLVTPHLTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAG 

2701 -f + + + + + 2760 

AGTCCTGGGACGAGGCTCCACAGGGACTCATACCGACGCACC^CTTGAACGCCTTCTGTC 

SGPWSEVSLSMAAW*TCGRQ 
QDPGPRCP*VWLRGELAEDS 
RTLVRGVPEYGCVVNLiRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGT^ 

2761 + + + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTA 

W*TSL* KTRPWVARLLFRCR 
GELPCRRRGPGWHGFCSDAG- 
VN F PVEDEALGGTAFVQMPA- 

CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + + + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

PTAYSPGA ACCWIPGPWRCR 
PRPI PLVRPAAGYPDPGGAE 
HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC^ACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTG 
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a ATTPAMPGPPSEPVSPSTAA 

b RLLQLCPDLHQSQSHLQPRL- 

C DYSSYARTS IRASLTFNRGF- 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCT^ 

2941 + + + + + + 3000 

AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACTTCACAGTGT 

a SRLGGTCVANSLGSCG* SVT 

b QGWEEHASQTLWGLAAEVSQ- 

C KAGRNMRRKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAAC^^ 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGC 

a ACFWICR*TASRRCAPTSTR 

b PVSGFAGEQPPDGVHQHLQD- 

C LFLDLQVNSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTT(^CGCATGTGTGCTGCAGCTCCCATTTCATC^G^ 

3061 + + + + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

a SSCCRRTGFTHVCCSSHFIS 

b PPAAGVQVSRMCAAAPISSA- 

c LLLQAYRFHACVLQL PFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCT 

3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGACGA 

a KFGRTPHFSCASSLTRPPSA 

b SLEEPHIFPARHL*HGLPLL- 

c VWKNPTFFLRVI SDTASLCY- 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC 

3181 + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

a TPS * KPRTQGCRWGPRAPPA 

b LHPESQERRDVAGGQGRRRP- 

c S ILKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGGATTCCTGCTCAAGCTG 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 

a LCPPRPCSGCATKHSCSS*L 

b SALRGRAVAVPPS I PAQADS 

C LPSEAVQWLCHQAFLLKLTR- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

a DTVSPTCHSWGHSGQPRRS * 

b TPCHLRATPGVTQDSPDAAE 

C HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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VGSSRGRR* LPWRPQPTRHC 



b SEAPGDDADCPGGRSQPGTA- 
C RKL PGTTLTALEAAAN PAL P - 

CCTCAGACTTCmGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCA 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

C SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSS PVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT- 

c HQQPCHAGLYVPGREGRPTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGT 

3541 + + + + + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

a PGPHRWESEA*VSVWPRPAC 

b QARTAGSLRPE*VFGRGLHV- 

C RPAPLGV* GLSE CLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

3601 + + + + + + 3660 

GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAC 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA*ASVQPRAEC - 

c G* RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a SSTPAVFTSPQAGARLHPRA 

b PAHLPSSLPHRLALGSTPGP 

C QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 

3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

c LFLTRS PASTPH IGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

C RHCSPLALPSFAFHPHHPGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTT^ 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACC 



FIG. 51H 

+ 



' W 71/78 



a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

c DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGG 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACC<^GTTTAACCCCCCTCCA 

a YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGGPCGSNWGEV- 

c HRRG PCTWMGVPVGQ I GGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTT^ 

3961 + + + + + + 4020 

CGACACCCTCATTTTATGACTTATATACT 

a AVGVKY* IYEFFSFEKKKKK 

b LWE*NTEYMSFSVLKKKKKK- 

C CGSKILNI * V F Q F * KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

a K K K - 

b K K 

c K K 

FIG. 511 
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i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 

CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 

his tyr arg glu val leu pro leu ala thr phe val arg arg leu 

CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 

GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 

TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 

GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 

leu lys glu leu val ala arg val leu gin arg leu cys glu arg 

CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 

GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 

GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 

TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 

trp gly leu leu leu arg arg val gly asp asp val leu val his 

TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 

CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 

ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 

GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 

ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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leu 
CTC 


tyr 
TAC 


ser 
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glu 
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arg 
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arg 
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leu 
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phe 
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leu 
CTG 


val 
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leu 
CTC 
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CTC 



00 

glu arg ala 
GAA CGG GCC 



gly leu pro 
GGC CTG CCA 

230 

arg ser leu 
CGA AGT CTG 



glu pro glu 
GAG CCG GAG 

260 

gly arg thr 
GGC AGG ACG 



pro ala arg 
CCT GCC AGA 
290 

ser gly thr 
TCT GGC ACG 



ala gly pro 
GCG GGC CCC 

320 

pro cys pro 
CCT TGT CCC 



ser gly asp 
TCA GGC GAC 

350 

leu arg pro 
CTG AGG CCC 



phe leu gly 
TTT CTG GGT 

380 

pro arg leu 
CCC CGC CTG 



glu leu leu 
GAG CTG CTT 

410 

lys thr his 
AAG ACG CAC 
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trp asn his 
TGG AAC CAT 

220 

ala pro gly 
GCC CCG GGT 



pro leu pro 
CCG TTG CCC 

250 

arg thr pro 
CGG ACG CCC 



arg gly pro 
CGT GGA CCG 

280 

pro ala glu 
CCC GCC GAA 

arg his ser 
CGC CAC TCC 

310 

pro ser thr 
CCA TCC ACA 



pro val tyr 
CCG GTG TAC 

340 

lys glu gin 
AAG GAG CAG 



ser leu thr 
AGC CTG ACT 

370 

ser arg pro 
TCC AGG CCC 



pro gin arg 
CCC CAG CGC 

400 

gly asn his 
GGG AAC CAC 



cys pro leu 
TGC CCG CTG 



ser val arg 
AGC GTC AGG 



ala arg arg 
GCG AGG AGG 



lys arg pro 
AAG AGG CCC 



val gly gin 
GTT GGG CAG 



ser asp arg 
AGT GAC CGT 



glu ala thr 
GAA GCC ACC 

his pro ser 
CAC CCA TCC 



ser arg pro 
TCG CGG CCA 



ala glu thr 
GCC GAG ACC 



leu arg pro 
CTG CGG CCC 



gly ala arg 
GGC GCT CGG 



trp met pro 
TGG ATG CCA 



tyr trp gin 
TAC TGG CAA 



ala gin cys 
GCG CAG TGC 



arg ala ala 
CGA GCT GCG 



210 
glu ala gly 
GAG GCC GGG 



arg gly gly 
CGC GGG GGC 

240 

arg arg gly 
AGG CGT GGC 



gly ser trp 
GGG TCC TGG 

270 
gly phe cys 
GGT TTC TGT 



ser leu glu 
TCT TTG GAG 
300 

val gly arg 
GTG GGC CGC 



pro arg pro 
CCA CGT CCC 

330 
lys his phe 
AAG CAC TTC 



ser phe leu 
TCC TTC CTA 

360 
arg leu val 
AGG CTC GTG 



gly thr pro 
GGG ACT CCC 

390 
met arg pro 
ATG CGG CCC 



pro tyr gly 
CCC TAC GGG 

420 
val thr pro 
GTC ACC CCA 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 



ile gly ile 
ATT GGA ATC 

ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 

590 

arg gin his 
AGA CAG CAC 

glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 



430 



arg 
CGG 


glu 

GAG 


lys 
AAG 


thr 
ACA 


asp 
GAC 


pro 
CCC 


pro 
CCC 


trp 
TGG 


460 
gin 
CAG 


val 
GTG 


pro 
CCC 


pro 
CCA 


leu 
CTC 


arg 
AGG 


490 
asn 
AAC 


leu 
CTC 


ser 
TCG 


leu 
CTG 


ala 
GCT 


trp 
TGG 


520 
leu 
CTG 


glu 
GAG 


his 
CAC 


arg 
CGT 


leu 
CTG 


met 
ATG 


550 
ser 
AGT 


val 
GTC 


thr 
ACG 


glu 
GAG 


lys 
AAG 


ser 
AGT 


580 
val 
GTC 


leu 
TTG 

gin 
CAG 


lys 
AAG 

his 
CAT 


arg 
AGG 
610 
arg 
CGG 


phe 
TTC 


ile 
ATC 


pro 
CCC 


tyr 
TAC 


val 
GTC 


640 
val 
GTG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 

glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 
thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 
glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 
arg glu leu 
CGG GAG CTG 

pro ala leu 
CCC GCC CTG 

630 
gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC. 
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arg glu lys 
AGA GAA AAG 



phe ser val 
TTC AGC GTG 



gly ala ser 
GGC GCC TCT 



thr phe val 
ACC TTC GTG 



tyr phe val 
TAC TTT GTC 



gin asp arg 
CAG GAC AGG 



asn thr tyr 
AAC ACG TAC 



his gly his 
CAT GGG CAC 



thr asp leu 
ACA GAC CTC 



glu thr ser 
GAG ACC AGC 



ser leu asn 
TCC CTG AAT 



phe met cys 
TTC ATG TGC 



gin cys gin 
CAG TGC CAG 



cys ser leu 
TGC AGC CTG 



ile arg arg 
ATT CGG CGG 



50 

arg ala glu 
AGG GCC GAG 



leu asn tyr 
CTC AAC TAC 

680 

val leu gly 
GTG CTG GGC 



leu arg val 
CTG CGT GTG 

710 

lys val asp 
AAG GTG GAT 



leu thr glu 
CTC ACG GAG 

740 

cys val arg 
TGC GTG CGT 



val arg lys 
GTC CGC AAG 

770 

gin pro tyr 
CAG CCG TAC 



pro leu arg 
CCG CTG AGG 

800 

glu ala ser 
GAG GCC AGC 



his his ala 
CAC CAC GCC 

830 

gly ile pro 
GGG ATC CCG 



cys tyr gly 
TGC TAC GGC 

860 

asp gly leu 
GAC GGG CTG 
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arg leu thr 
CGT CTC ACC 

670 

glu arg ala 
GAG CGG GCG 



leu asp asp 
CTG GAC GAT 

700 

arg ala gin 
CGG GCC CAG 



val thr gly 
GTG ACG GGC 

730 

val ile ala 
GTC ATC GCC 



arg tyr ala 
CGG TAT GCC 

760 

ala phe lys 
GCC TTC AAG 



met arg gin 
ATG CGA CAG 

790 

asp ala val 
GAT GCC GTC 



ser gly leu 
AGT GGC CTC 

820 

val arg ile 
GTG CGC ATC 



gin gly ser 
CAG GGC TCC 

850 

asp met glu 
GAC ATG GAG 



leu leu arg 
CTC CTG CGT 



ser arg val 
TCG AGG GTG 



arg arg pro 
CGG CGC CCC 



ile his arg 
ATC CAC AGG 



asp pro pro 
GAC CCG CCG 



ala tyr asp 
GCG TAC GAC 



ser ile ile 
AGC ATC ATC 



val val gin 
GTG GTC CAG 



ser his val 
AGC CAC GTC 



phe val ala 
TTC GTG GCT 



val ile glu 
GTC ATC GAG 



phe asp val 
TTC GAC GTC 



arg gly lys 
AGG GGC AAG 



ile leu ser 
ATC CTC TCC 



asn lys leu 
AAC AAG CTG 



leu val asp 
TTG GTG GAT 



660 
lys ala leu 
AAG GCA CTG 



gly leu leu 
GGC CTC CTG 

690 
ala trp arg 
GCC TGG CGC 



pro glu leu 
CCT GAG CTG 

720 

thr ile pro 
ACC ATC CCC 



lys pro gin 
AAA CCC CAG 

750 
lys ala ala 
AAG GCC GCC 



ser thr leu 
TCT ACC TTG 

780 

his leu gin 
CAC CTG CAG 



gin ser ser 
CAG AGC TCC 

810 

phe leu arg 
TTC CTA CGC 



ser tyr val 
TCC TAC GTC 

840 

thr leu leu 
ACG CTG CTC 



phe ala gly 
TTT GCG GGG 

870 
asp phe leu 
GAT TTC TTG 
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880 
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1070 
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phe 
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1100 



1110 



arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 

AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 

CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 



phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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